Picture for Yuqian Yuan

Yuqian Yuan

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Add code
May 28, 2026
Viaarxiv icon

InstructSAM: Segment Any Instance with Any Instructions

Add code
May 25, 2026
Viaarxiv icon

CrossView Suite: Harnessing Cross-view Spatial Intelligence of MLLMs with Dataset, Model and Benchmark

Add code
May 18, 2026
Viaarxiv icon

LMMs Meet Object-Centric Vision: Understanding, Segmentation, Editing and Generation

Add code
Apr 13, 2026
Viaarxiv icon

RynnBrain: Open Embodied Foundation Models

Add code
Feb 13, 2026
Viaarxiv icon

MAU-GPT: Enhancing Multi-type Industrial Anomaly Understanding via Anomaly-aware and Generalist Experts Adaptation

Add code
Jan 31, 2026
Viaarxiv icon

Unified Personalized Understanding, Generating and Editing

Add code
Jan 11, 2026
Viaarxiv icon

AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization

Add code
Dec 29, 2025
Viaarxiv icon

RynnEC: Bringing MLLMs into Embodied World

Add code
Aug 19, 2025
Viaarxiv icon

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

Add code
Jun 05, 2025
Figure 1 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 2 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 3 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 4 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Viaarxiv icon